Systematic Evaluation of Convergence Criteria in Iterative Training for NLP

نویسندگان

  • Patricia Brent
  • Nathan David Green
  • Paul Breimyer
  • Ramya Krishnamurthy
  • Nagiza F. Samatova
چکیده

Natural Language Processing (NLP) tasks, such as Named Entity Recognition (NER), involve an iterative process of model optimization to identify different types of words or semantic entities. This optimization to achieve a more precise model becomes computationally difficult as the number of iterations increase. The small datasets available for training typically limit the models. Adding iterations on such sets to further optimize the model can often cause over-fitting, which generally leads to reduced performance. Therefore, the choice of convergence criteria is a critical step in robust and accurate model building. We evaluate different convergence criteria in terms of their robustness, stopping threshold selection, and independence from the training data size and entity. The underlying framework employs a limitedmemory Broyden-Fletcher-Goldfarb-Shanno (L-BFGS) parameter optimization in the context of Conditional Random Fields (CRF). This paper presents a convergence criterion for robust training irrespective of semantic types and data sizes with two-orders of magnitude reduction in stopping threshold for improved model accuracy and faster convergence. Additionally, we examine convergence with active learning to further reduce the training data and training time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Solving systems of nonlinear equations using decomposition technique

A systematic way is presented for the construction of multi-step iterative method with frozen Jacobian. The inclusion of an auxiliary function is discussed. The presented analysis shows that how to incorporate auxiliary function in a way that we can keep the order of convergence and computational cost of Newton multi-step method. The auxiliary function provides us the way to overcome the singul...

متن کامل

High order quadrature based iterative method for approximating the solution of nonlinear equations

In this paper, weight function and composition technique is utilized to speeds up the convergence order and increase the efficiency of an existing quadrature based iterative method. This results in the proposition of its improved form from a two-point quadrature based method of convergence order ρ = 3 with efficiency index EI = 1:3161 to a three-point method of convergence order ρ = 8 with EI =...

متن کامل

A SIXTH ORDER METHOD FOR SOLVING NONLINEAR EQUATIONS

In this paper, we present a new iterative method with order of convergence eighth for solving nonlinear equations. Periteration this method requires three evaluations of the function and one evaluation of its first derivative. A general error analysis providing the eighth order of convergence is given. Several numerical examples are given to illustrate the efficiency and performance of the new ...

متن کامل

The Criteria for Evaluation of the Integration of Information and Communication Technology in the Curriculum: A Systematic Review

Objective: This study aimed to review the criteria for evaluating the integration of information and communication technology (ICT) in the curriculum, and given its significance, provide the necessary assessment recommendations. Material & Methods: This study was a theoretical-systematic review performed with keywords such as "integration," "evaluation," "Information and communication technolo...

متن کامل

Convergence of an Iterative Scheme for Multifunctions on Fuzzy Metric Spaces

Recently, Reich and Zaslavski have studied a new inexact iterative scheme for fixed points of contractive and nonexpansive multifunctions. In 2011, Aleomraninejad, et. al. generalized some of their results to Suzuki-type multifunctions.  The study of iterative schemes for various classes of contractive and nonexpansive mappings is a central topic in fixed point theory. The importance of Banach ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009